Skip to content

[3.14] gh-138907: Support RFC 9309 in robotparser (GH-138908)#149374

Merged
serhiy-storchaka merged 1 commit intopython:3.14from
miss-islington:backport-bc285e5-3.14
May 4, 2026
Merged

[3.14] gh-138907: Support RFC 9309 in robotparser (GH-138908)#149374
serhiy-storchaka merged 1 commit intopython:3.14from
miss-islington:backport-bc285e5-3.14

Conversation

@miss-islington
Copy link
Copy Markdown
Contributor

@miss-islington miss-islington commented May 4, 2026

  • empty lines are always ignored instead of separating groups
  • the "user-agent" line after a rule starts a new group
  • groups matching the same user agent are now merged
  • the rule with the longest match wins instead of the first matching rule
  • in case of equal matches, the “Allow” rule wins over “Disallow”
  • special characters “$” and “*” are now supported in rules
  • prefer full match for user agent
    (cherry picked from commit bc285e5)

Co-authored-by: Serhiy Storchaka storchaka@gmail.com

* empty lines are always ignored instead of separating groups
* the "user-agent" line after a rule starts a new group
* groups matching the same user agent are now merged
* the rule with the longest match wins instead of the first matching rule
* in case of equal matches, the “Allow” rule wins over “Disallow”
* special characters “$” and “*” are now supported in rules
* prefer full match for user agent
(cherry picked from commit bc285e5)

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
@serhiy-storchaka serhiy-storchaka enabled auto-merge (squash) May 4, 2026 18:21
@serhiy-storchaka serhiy-storchaka merged commit 3b0a3c4 into python:3.14 May 4, 2026
50 checks passed
@miss-islington miss-islington deleted the backport-bc285e5-3.14 branch May 4, 2026 18:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants